Corpus: por-cv_web_2013_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 95 97 97 98 98
1000 855 949 976 989 990
10000 5803 8159 9195 9451 9533
100000 5804 8160 9196 9452 9534
1000000 5804 8160 9196 9452 9534


Zipf's diagram for sentence endings


Gnuplot diagram

2190 msec needed at 2018-06-02 16:27